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CONVERGENCE AND STABILITY PROPERTIES OF MINIMAL POLYNOMIAL 
AND REDUCED RANK EXTRAPOLATION ALGORITHMS 
Avram Sidi* 

National Aeronautics and Space Administration 
Lewis Research Center 
Cleveland, Ohio 44135 

ABSTRACT 

The minimal polynomial and reduced rank extrapolation algorithms are two 
acceleration of convergence methods for sequences of vectors. In a recent sur- 
vey these methods were tested and compared with the scalar, vector, and topo- 
logical epsilon algorithms, and were observed to be more efficient than the 
latter. It was also observed that the two methods have similar convergence 
properties. The purpose of the present work is to analyze the convergence and 
stability properties of these methods, and to show that they are bona fide ac- 
celeration methods when applied to a class of vector sequences that includes 
those sequences obtained from systems of linear equations by using matrix iter- 
ative methods. 

1. INTRODUCTION 

The minimal polynomial extrapolation (MPE) and the reduced rank extrapola- 
tion (RRE) algorithms are two methods that have been devised for accelerating 
the convergence of sequences of vectors. In a recent survey carried out by 
D. A. Smith, W. F. Ford, and A. Sidi (unpublished) these two methods were 
tested and compared with the scalar, vector, and topological epsilon algo- 
rithms.. It was observed numerically that the MPE and the RRE have similar con- 
vergence properties and are more efficient than the three epsilon algorithms. 

In the present work we analyze the convergence and stability properties of the 
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MPE and the RRE, and show that they are bona tide convergence acceleration 
methods when applied to a family of sequences that includes those sequences 
obtained from systems of linear equations by using matrix iterative methods. 

In a recent work [3] a general framework for deriving convergence acceler- 
ation methods for vector sequences has been proposed. Within this framework 
one can derive several methods, some old (including the MPE, RRE, and the top- 
ological epsilon algorithm), and some new (including a method that has been 
designated the modified MPE). The approach of [3] is formulated in general 
normed linear spaces of finite or infinite dimension. In the present work we 
use the formulations of the MPE and the RRE as they are given in [3]. We also 
make the assumption that the normed linear space in which the vector sequence 
is defined is an inner product space, with the norm being induced by the inner 
product. 

The plan of the present paper is as follows: In Section 2 we specify the 

vector sequences whose convergence we are seeking to accelerate, describe the 
MPE and the RRE as they were formulated in [3], deriving at the same time de- 
terminant representations for them. We shall use Section 2 to also introduce 
much of the notation that we use in the remainder of this work. In Section 3 
we analyze the convergence properties of both methods and obtain actual rates 
of acceleration for them. In Section 4 we analyze their stability properties. 
The results of Sections 3 and 4 are helpful in explaining some of the numerical 
results obtained from the MPE and the RRE. The techniques used in the present 
work are similar in nature to those developed and used in [3] in the analysis 
of the modified MPE and the topological epsilon algorithm. The analysis in the 
present work, however, is considerably heavier due to the extreme nonlinearity 
of the MPE and the RRE. Surprisingly, al 1 the conclusions that were drawn for 
the modified MPE hold for MPE and RRE. 
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2. NOTATION AND DESCRIPTION OF ALGORITHMS 


Let B be an inner product space defined over the field of complex num- 
bers. In this work we shall adopt the following convention for the homogeneity 
property of the inner product. For y,zcB and a,B complex numbers, the 
inner product (•»•) is defined such that (oy.Bz) = 7e(y,z). The norm of a vec- 
tor xcB will be defined by ||x|| = y(x,x). 

Let us consider a sequence of vectors x^, i = 0, 1, ...» in B. We 
shall assume that 

oo 

(2.1) ^ as m - 

i=l 


where s and v^, i = 1, 2, ..., are vectors in B, and x^, i =1, 2, .... 

are scalars, such that x^ ^ 1, i = 1, 2, ..., ^-j ^ ^ J» |^il 2. |^2l 

2 ..., and that there can be only a finite number of x. whose moduli are 

equal. Without loss of generality, we assume in (2.1) that v^ ^ 0, x^ 0 
for all i 0. The meaning of (2.1) is that for any integer N > 0, there 
exist a positive constant K and a positive integer that depend only 
on N, such that for every m m^ the vector 


satisfies 

(2.1a) 




< K. 


If lx, I < 1, then lim x exists and is simply s. If x, > 1, then 

111 m#»«> m 111 — 

lim X does not exist, and s is said to be the anti-limit of the sequence 

m^oo m 

X|^, m = 0, 1, .... Our problem is to find a good approximation to s, whether 
it be the limit or the anti-limit of the sequence, from a relatively small num- 
ber of the vectors x^, i = 0, 1, .... 
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Example . Let A be a nondefective MxM (complex) matrix and b, an 
M-dimensional (complex) vector, and consider the solution of the linear system 
of equations 

(2.2) X = Ax + b. 

Let ..., and v^, ...» Vj^ be the eigenvalues and corresponding eigen- 
vectors of A. Assume also that 1 is not an eigenvalue of A so that (2.2) 
has a unique solution, which we shall denote by s. For a given Xq, we gen- 
erate the sequence x-, j = 1, 2, ..., by the matrix iterative method 

J 

(2.3) ^j+1 ~ 

M 

Let x^ - s = > a.v. for some scalars a,-. Then 
0 /-/ 11 1 

i=l 

M 

(2.4) x^ = ^ X “i''iS’ = 0* 1 

i=l 

As is known if ^ 0, which will be the case for the given Xq in general, 

then lim x = s provided lx, I < 1, otherwise s is the anti-limit. 
n><= m ^ 111 

2 

Let us denote u^ = ax^ = x^^j^ ~ ^i» "* ~ **•* '^i “ ^ ^-j» 

1 — 0 , 1 , ... . 

The MPE . Let k be an integer less than or equal to the dimension of the 
space B. The approximation s^^ to s is given by 

k 

^n,k = Yj ^j^n+j’ 

J=0 

where the yj a**® obtained from 
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( 2 . 6 ) 


T. = 


J k 

i=0 


'» j — 0» 1> k, 


with C|^ = 1, and Cq, ...» Cj^ ^ being determined as the solution to the min- 
imization problem 


(2.7) 


r r 

Cq, 


k-1 

z 

i=0 


C .u • + U .1 

J n+j n+k 


k 

[ 

i=0 


provided ^ ^ 0. 


For the example above, (2.7) is equivalent to the least squares solution 
of the overdetermined system of M equations 


(2.8) 

U , c = -u . , , 
n,k n+k’ 


where 

is the M X k matrix 


(2.9) 

^n,k " ^“n’ “n+1’ •**’ ^n+k-1^’ 

M 

and c 

is the column vector (Cq, Cj^, ..., Cj^_j^)^, when (y,z) 



i=l 


where y = ^y^, .... y*^^ and z = ^z^, ..., z^^ , and y* is the Hermitean 
conjugate of y. This is the way the MPE was developed originally in [1]. 

We shall now give a determinant expression for s^ that will be of 


use in the remainder of this work. Since ||y|| = \7(y,y) for any vector y 

in B, the c^. that solve the minimization problem in (2.7) satisfy the normal 
equations 
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k-1 

^ “ ~^^n+i *^n+k^ ’ 0 <. i <. *^ ” !• 

j=0 

Consequently, the Yj that are defined by (2.6) satisfy the equations 


( 2 . 11 ) 




j=0 

k 

E (Vi»Vj)"j = 0* 0< i < k - 1, 

j=0 


provided these equations have a solution. Assuming that the determinant of the 
matrix of equations (2.11) is nonzero, and using Cramer's rule, we can write 
the solution of (2.11) as 


(2.12) 


N. 


■ k 


N. 

J 


E". 

i=0 


DTTTTTTTT)-’ " ^ J ^ 


, 0 < j < k. 


where is the cofator of a- in the determinant 

J J 


(2.13) 


tl( • • • » 0 |, ) — 


with 


(2.13) is to be interpreted as 


°0 

°1 

• • • 

\ 


“o,o 

^0,1 

• • • 

"0,k 


“l,0 

"1,1 

• • • 

'"l,k 


• 

• 


• 


• 

• 

“k-1,0 

• 

• 

Vi,i 

• • • 

• 

• 

"k-l,k 


0. In 

the first 

row of the 

determinant 

|> • • • » 

to be 

vectors in 

B, in 

which 
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(2.14) 


D(cJq, • • • > 


r\ 


i=0 


Combining (2.5), (2.12), and (2.14), we can now express as 


(2.15) 


D(x^, ..., 

*n,k ■ D(l, ..., 1) * 


The RRE . Let k be an integer less than or equal to the dimension of the 
space B. The approximation s^ to s is given by 

k-1 

(2-16) = + ^ q, 

1=0 

where the are determined as the solution of the minimization problem 


(2.17) 


min 

'<0’****\-l 


k-1 

u + / q w . . 
n Z-/ ^1 n +1 

i=0 


For the example above, (2.17) is equivalent to the least squares solution 
of the overdetermined system of equations 


(2.18) 


Wn.k'’ = 


where is the M x k matrix 

(2.19) 


'^n,k " ^'^n* Vl’ Vk-1^» 


M 


and 


q is the column vector q = ^qQ, q^^, ..., when (y,z) = ^ 


where y and z are again y = ^y^, ..., y^^ and z = ^z^, ..., z*^^ . 
This is the way the RRE was developed originally in [2]. 
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For this case too s . can be expressed as the quotient of two deter- 

I I ^ 

minants, and we turn to this now. Again by the fact that ||y|| = for 


any vector y in B, the that solve the minimization problem in (2.17) 
satisfy the normal equations 


k-1 


( 2 . 20 ) 


Z 0 < * < K - !• 

J-0 


Substituting w^^^. = - u^^j on the left hand side of (2.20), and re- 

arranging, we obtain 

k-1 

(2.21) - ‘lo> Z - '’j' * ■ ° 

j=l 

0 < i < k - 1 


Let us define 


( 2 . 22 ) 


^ ^ ~ \ “ Vl* “ '^j-l ~ ^ i J 


It is easily verified that 
(2.23) 


Z I'j “ 

0=0 


so that (2.22) and (2.23) establish a one-to-one correspondence between the 
q., 0 < i < k-1, and the y-, 0 £ j £ k. Consequently, the linear system 
of equations (2.20) for the q. is equivalent to the linear system 

J 


(2.24) 


Z ^ 

j=0 

k 

Z <Vi-Vj>''o = 0. 0 < 1 < k - 1. 
0=0 
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for the Y.» by (2.21), (2.22), and (2.23). Similarly, substituting 

J . 

“ ^n+i right hand side of (2.16), rearranging, and in- 

voking (2.22), we see that s . for RRE (as for MPE) can be expressed in the 

n } K 

form (2.5). Assuming now that the determinant of the matrix of equations 
(2.24) is nonzero, and using Cramer's rule, we can express the yj that solve 
(2.24) exactly as in (2.12), where N. is the cofactor of o. in the determi- 
nant D(oq, ..., o,^) given in (2.13) with u.^j = (w^+^, i,j >.0. Again 

when og are vectors in B, D(og, ..., oj^) is to be interpreted as 

in (2.14). Consequently s^^ |^, also for RRE, can be expressed as in (2.15). 

Before closing this section we shall state a result that will be of use 
in the remainder of this work. 

Lemma 2.1 . Let i^, i^^, ..., ij^ be integers greater than or equal to 1, 

and assume that the scalars v. . are odd under an interchange of any 

' 0 *****\ 

two indices i^, ..., ij^. Let o^, i > 1, be scalars (or vectors) and let 
t. ., i _> 1 £ j <. k, be scalars. Define 
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where the determinant in (2,26) is to be interpreted in the same way as 


D(oq, .... 0 |^) in (2.13). Then 
(2.27) 


^k,N “ 


For a proof of Lemma 2.1, see [3, Appendix]. 

3. CONVERGENCE ANALYSIS 

We have seen in the previous section that s^ for both the MPE and 
the RRE, is given by 


(3.1) 


*n,k = D(l, ...» 1) ’ 


where D(aQ, ..., o,^) is defined by (2.13) with ^ i,j 2 i 0, 

for MPE, and iJ-j j = (w^+^,u^^.j), i,j >^0, for RRE. Subtracting s from both 
sides of (3.1), and making use of (2.14), we obtain the error formula 


(3.2) 


s , - s = 
n,k 


D(Xjj — s, ..., — s) 


'n+k 
D(l, ..., 1) 


Under the assumption (2.1), we have 


(3.3) 


“in"Z *1<*i - 
i=l 


and consequently. 


(3.4) 


w„ ~ / V-. (x. - l)^x'!' as m o», 

m 1 ' 1 ' 1 

i=l 


From (3.3) and (3.4) we see that 
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oo 


00 


^p,q “ (^n+p’Vq^ ~ X X 

i=l j=l 


n+q 


as n -► « , 


for MPE, and 


00 00 


(3.6) Up_, = (Wn+p-Vq) ' Z Z 

i=l j=l 


as n , 


for RRE. (3.5) and (3.6) can be rewritten in the condensed and unified form 


oo oo 


(3.7) 


P,q Z-/ ij 1 

i=l j=l 


-n+p n+q 

' ^x. as n •*■ ®, 

w 


where we have defined 


(3.8) z.^ = (v^,Vj.)(x^ - l)(Xj. - 1), i,j > 1, 

for MPE, and 

(3.9) z^.j. = (v,»Vj.)(^.j - - 1)» i.j > 1» 

for RRE. 

Note that when the sequence x., j = 0, 1, ..., is generated by a matrix 

J 

iterative process as described in the example of Section 2, then in (3.7) the 
upper limits on the summations on the right hand side are replaced by M, and 
~ is replaced by =. 

In Lemma 3.1 and Lemma 3.2 below we derive the asymptotic expansions of 
D(Xj^ - s, ..., Xj^+i^ - s) and 0(1, ..., 1), respectively, for n -*• ®, assuming 
(2.1). In Theorem 3.1 we give the main result of this paper that shows that 
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both the MPE and RRE are true acceleration methods when applied to sequences of 
vectors satisfying (2.1), in the sense that JjJm ||s^ - sjj = 0. 

In what follows we shall denote asymptotic relations of the form “a^ ~ 6^ 

00 

as n > «" by "o^ ~ for short. Also we shall let 2 S» 


S = Z E . and 
i.J i=l j=l 


i i 2 ^ • • * ^ 




'1-1 '2=’ 1*1 


. . E .for 

lk=lk-l*l 

simplicity. 

Lemma 3.1 . Let the sequence of vectors. x.j, i =0, 1, 2, ...» be as 
described in the previous section. For ip, jp positive integers, define 


(3.10) 




''jr 




' 1^0 
'^2^0 


'Ji 

^2J‘i 


• • 


'Jk 


■'A 


2 . . 
’2^k 








l|rJ 


k'^k 


where the interpretation of this determinant is like that of D(oq, q^, ..., 0|^) 
in (2.13) and (2.14). Let V(cq, cp •••» t|^) be the Vandermonde determinant 

-k 


(3.11) ^(Cq> ^1* •*•* ~ 


1 


= TT 

0<i<j<k 


Then D(x^ - s, ..., x^^+j^ - s) has the asymptotic expansion given by 
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(3.12) D(x^ - s, - s) ~ ^ (TT^j 

j 2*^ • • • “^ j 


' (\- ‘Ji’ •••’ \) „. IT^( 


\ P“" 


/— — \ ii»***>\ 

''Pi* “•’ 'ij 


Proof . Let us denote D(x„ - s, ..., x^^+j^ - s) by P„ for short. By 


(2.13), (2.1), and (3.7), we have 


V V . x" y' V. x" 

Jq Jq 4-' "^0 


(3.13)p - 

n 


n +1 

'o 




’r^i 


y z. .7*V. 

^ ’2 ^2 


’2*^2 


y z. 


'z-^z 


E 

‘k-ik 


—n+k -1 n 
k k k k 


E 

\-ik 


—n+k-l n +1 
k k k k 


V V. X . 


n+k 


'0 ‘'0 


E yn n ^ r" ,n+l 

• z . . X . X . \ 2 ,.X,X, • • • \ 

’i^i ’i Z'. ^ 1 '^1 


’i-J] 


# • 


I2.J2 


^+1 n+k 

A • A » 

2^2 '2 h 


E z . X ; X . 

*oJq 


E 

'k-<k 


— n+k -1 n+k 
z. . X . X . 

'k-ik ’k ^k 


By the multi linearity property of determinants we can reexpress (3.13) in 
the form 
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(S-H) Pn-X Z • • • Z n^1 

h 'pJi VJk “ °Vp=l ” ^ ”> 


X V 


Jq -^1 '^k/ 


By changing the order of summation in (3.14), we have 


(3.15) p„ • • • Z (TT*i ^)z Z • • • Z ''jqITT ^ipjp, 

1l 1|< \P=1 /Jo li jfc \p»l 


n v'(v V ■■■’ \) 

<p=0 

V (A. , A. , ..., A. j is odd under an interchange of any two 

V Jq ^kl 


Since 



of the indices jp, Lemma 2.1 can be applied, and we obtain 


H \P= 


'3-' ^-z •••z(nc') . 


Q< j 


0‘jp)''(v V •••’ \) 


Interchanging the order of the summations in (3.16), we have 
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(3.17) 



in,.. 

R. . 

Jo»Oi 


.,i|^ 

» . . . > J|^ 


Let now N . 

J 


1 ** 


.ii 


be the cofactor of v 


R.- 




Jq* ^1 » ♦ • • » J|^ 


Then 


in the determinant expression for 


(3.18) 


In,... 

R • 

jonj'l* 




V. N/ 

3q 



1 1 , ... 1 . 


are odd under an interchange of any 


Observing from (3.10) that the N- 
two of the indices ij^, ..., ij^, we can apply Lemma 2.1 again to the multiple 
sum with respect to the indices i ,,..., i., with R. . . replaced by 

J. K * * * • k 

(3.18). The terms that are odd in the indices i^^, 
k 

By Lemma 2.1 then 


now are 



(3.19) 


z (n 

l<jo<Ji<...<j^ \p=0 






which, by invoking (3.18), can be reexpressed as (3.12), thus completing the 
proof. a 
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be as 


Lemma 3.2 . Let the sequence of vectors x^, i =0, 1, 2, 


described 

in the previous section. 


positive integers, define 



^^1^1 

'hJ‘2 ‘ 

. . z. . 

hh 


(3.20) 

g’l**‘**\ _ 

Jj » • • • 5 J|^ 

^i2Ji 

• 

• 

^^2'^2 

• 

• 

• • j 

2’^k 

• 

• 



• 

• 

^i j 

. . • 

k-^k 



Then D(l, 1) has the asymptotic expansion given by 

‘O' 


y (TT7| Ivfx, . 


Proof. Let us denote D(l, 1) by for short. From (2.13) and 
(3.7), we have 

1 1 ... 1 


(3.22) ‘^n 


T] z X? x" y 2- • 


i,.j, 




rn . n+1 
1 


V z. .r!V 

2 "^2 ’2 ^2 


’2”^2 


y z. 

Z^ ^2'^2 ’2 ■^2 

I2.J2 


— n+k-1 n 
2 . . X, X_. 


y "'J.,"!,, “J 


'k-^k 


k'^k k '"k 


V z. 

k^k k -^k 

’k’\ 


y 2. .rix""'' 

A ’1 




y z. yV 
Z. I2J2 12 J2 


’2 ”^2 




—n+k-1 n+k 
z. . X . X . 

’k^k \ 'k 
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Again by the multi linearity property of determinants 


(3.23) Q 


n- E ••• z(n 


ll.J 


l.Jl 




.p=l 


' T^^P- 

'i j ^i 

pJp p 




J’l* 



By changing the order of summation in (3.23) and observing that 




is odd under an interchange of any two of the 


indices jp, we can apply Lemma 2.1 to the summation over jp ...» jj^. The 
result is 


(3.24) 





Changing the order of summation again, and observing that 

is odd under an interchange of any two of the indices ip, we can now apply 
Lemma 2.1 to the summation over ij^, ..., As a result, we obtain (3.21). 
This completes the proof. □ 



Theorem 3.1 . Let the sequence of vectors x^, i = 0, 1, 2, ..., be as 
described in the previous section. If, in addition, the v^ are linearly 
independent, and 

(3.25) |x^| > . . . > |x^| > > |x,^+2| > • • •. 
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then, for all sufficiently large n, exists, and 

(3.26) - s = F(n)x]J^.j[l + o(l)] as n ♦ », 

where the vector F(n) satisfies lk(")|| <K, for some constant K > 0 inde- 
pendent of n, and for all sufficiently large n. If, in addition, 

(3.27) |\+ll ^ K+2|» 
then 

(3.28) 

Proof. From (3.20) and (3.8) (or (3.9)), 


*^l,...,k+l TT / Vl ~ 

.,k I V - 1 

^l,...,k i=l \ ^ 


F(n) 


[1 + o(l)j as n + 


(3.29) 


= h G(vp ..., v,^). 


where H = 


TT (^1 - 1) 


for MPE (or H = (x^ - l)^(x^ - 1) for RRE j , 


1=] 


i=l 


and G(Vj^, ..., Vj^) is the Gram determinant of the vectors Vj^, ..., V|^, given 


by 


(3.30) G(vp 


(Vj^jVj^) (v^jV^) 

(v2»Vj) (V2»V2) 


(Vk.Vi) (\»V2) 


• • ( ^2 » ) 




Since the v^ are linearly independent, G(Vj^, ..., Vj^) is nonzero. Also, 
since x^ ^ 1 for all i, H is nonzero. Consequently, sj*****l^ is nonzero. 
Next, since x^ ^ Xj for i 4 j, and x^ ^ 1 for all i, V(Xj^, ..., Xj^) and 


18 


V(l, X|^) are nonzero on account of (3.11). Finally, we observe that, 

due to (3.25), the dominant term in the multiple sum on the right hand side of 

(3.21) in Lemma 3.2 would be that for which ip = p, = p, p = 1, ..., k, 

1 k 

provided V(l, ..., Xj^) and S^*****!^ are nonzero, which we have already 
proved to be so. Consequently, 

2 

•••» ^|^)V(Xj^, ..., X|^ ) 

X s}*****!^ [1 + o(l)] as n -*• ». 


(3.31) D(l, ..., 1) = 


k 

in 

p=i 


This also implies that, for all sufficiently large n, 0(1, ..., 1) 4 0, hence 

s . exists. This completes the proof of the first part of the theorem, 
n j K 

To prove (3.26), we should analyze the behavior of D(x^ - s, ..., 
x^+l^ - s) for n 00 . By our assumptions made following (2.1), there is only a 

finite number of x^ whose moduli are • Let = . . . = > 

• From this and (3.25), it follows that the dominant term on the right 
hand side of (3.12) - provided it is nonzero - is the sum of those terms with 
indices ip = p, p = 1, ..., k, jp_^ = p, p = 1, ..., k, = k + a, a = 1, 

..., r, namely 


(3.32) D(x^ - s. 




^k+A ^ 


Cl 0(1)] as 


l,...,k,k+A 


n -► 00 , 


From (3.10) and (3.20), 
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(3.33) 


= E M-i)' 

i=l 


so that the summation on the right hand side of (3.32) becomes 


(3.34) 


k 

i=l 


I 

E 

1=1 






(_l)k 

X ) • • • } l\ 


E 

j,=l 


'k+)i 


V(x, 


k+ji^Vn* 


Since sj’****j^ ^ 0, V(Xj^, ...» Xj^, ^ 0, and the vectors are lin- 

early independent, the second summation in (3.34) is never zero. This proves 
(3.32). Combining (3.31) and (3.32), (3.26) follows. If (3.27) holds, then 
r = 1. In this case (3.28) follows from (3.31), (3.32), and (3.11). This com- 
pletes the proof of the theorem. 

□ 

The asymptotic error analysis of the MPE and RRE as given in Theorem 3.1, 
leads us to the following important conclusions: 

(1) Under the conditions stated in the theorem, the MPE and RRE are bona 
fide acceleration methods in the sense that 


(3.35) 



^n,k - ^ 


1 

\*k*i - = 

r 


= 0 



as n -*• ®. 


This means that if -► s as n ®, i.e., | Xj| < 1, then ♦ s as 

n ®, and more quickly. Also if lim x does not exist, i.e., x. > 1, then 
1 ^ -*• s as n ®, provided that < !• The reason that we write 

^n+k+1 (3.35) is that s^ in both the MPE and RRE is computed from 

^n’ ^n+1’ •••* ^n+k+1* 


20 



(2) When the MPE and RRE are being applied to a vector sequence generated 

by using a matrix iterative method, they will be especially effective when the 
iteration matrix has a small number of large eigenvalues (k-many when is 

being used) that are well separated from the small eigenvalues. 

(3) As can be seen from (3.28), a loss of accuracy will take place in 

s I, when a., ..., are close to 1. For sequences of vectors obtained 

n j K X K 

from the iterative solution of linear systems of equations, this means that the 
matrix of the system is nearly singular. 

These conclusions are the same as those for the modified MPE, which has 
been defined and analyzed in [3]. 

Finally note that the results of this section (and of Section 4) will not 
change if the MPE and RRE are replaced by any other method giving rise to 

(3.1) , with D(oq, ..., a,^) defined as in (2.13), as long as the u^ ^ in 
(2.13) satisfy (3.7) with z. . = (v. ,v . )y . v . , where y. and v. are fixed 
nonzero constants. 

4. STABILITY ANALYSIS 

Let us denote y. in (2.5) by for both MPE and RRE. We say that 

3 3 

s is asymptotically stable if 
n J K 

k 

(4.1) sup V 

" jii ^ 

Roughly speaking, this means that if errors are introduced in the vectors Xj^, 

k 

Z (n k ) 

Yj - 1> the 

j=0 

most ideal situation is one in which >. 0» 0 £ j £ k, for all suffi- 

k k 

ciently large n, so that ^ ^ 

j=0 j=0 
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The following theorem gives the stability properties of both the MPE and 


RRE when they are applied to vector sequences satisfying the conditions of 

Theorem 3.1. s^ . denotes either that obtained from MPE or RRE. 
n,K 

Theorem 4.1 . For vector sequences satisfying the conditions stated in 
Theorem 3.1, s^^ is asymptotically stable. Actually, the following are true: 
(1) The satisfy 


(4.2) 


(n,k) _ 


va-,- x--v.-.vT^) ^ 0(1)] 


6is n 00 j 


where 


(4.3) C (5p ..., = (-1)' 


1 C 


1 5 






q-l q+1 

^k 




thus 


(4.4) 


k 

E 

q=0 


S |Cq(Xi, ..., x^) 



1 *1 
q=0 

""q 

V(l, •••» ^1^) 


[1 + 0 ( 1 )] 


as n ♦ », 


i.e., (4.1) holds. 

(2) If Xj^, ...» X|^ are real and negative, then 
(4.5) > 0, 0 £ q £ k, for n sufficiently large, 

thus 
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(4.6) 


rv 

z 

q=0 


(n,k) 


= 1 + o(l) as n + ». 


(3) If X|^ are real and positive, then 

(4.7) < 0, 0 £ q £ k - 1, for n sufficiently large. 


thus 


(4.8) 2 


q=0 


,{n,k) 


!i(^ 


[1 + o(l)] as n + ®. 


(4) For any ..., Xj^, 

k 


(4.9) 


lim 

n«-<n 


^ ^(f’»k),q 
q=0 


Y' ' •'X 




Proof . Since is given by (2.12), and we already know the asymp- 

totic behavior of D(l, ..., 1) from Lemma 3.2, it is sufficient to analyze 
Nq asymptotically. By deleting the first row and the (q + l)st column of the 
determinant in (3.22), after some manipulation, we have 


(4.10) Ng - ... (fy z . •••’ \) 


h»Ji 


lk»Jk \p= 


Observe that 


/ k 

* **** ^ under an interchange of any two 


\) " 


of the indices jp. Consequently, 
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( 4 . 11 ) 






X 


i i 2^ ♦ • * ^ ^ 




V 





****’^k 
» • • • > J|^ 


follows from (4.10) in exactly the same way (3.21) follows from (3.23). Invok- 
ing (3.25), 

(4.12) Nq = 

as n , 


In 

p=i 


'S 


V(x,, 


X|^) Cq(Xp 


•1, . . . ,k 


xj Sf**-**" [1 + 0 ( 1 )] 


follows from (4.11) in the same way (3.31) follows from (3.21). Combining 

(4.12) and (3.31) in (2.12), (4.2) follows. (4.4) follows directly from (4.2). 
This proves (1). 

Note that V(x, Xj^, ..., Xj^) is a polynomial of degree k in x. From 
(3.11) and (4.3) we have 

k 

(4.13) V(x, Xj^, ..., X|^) = ^ ^ **** ^k^^ * 

q=0 


Since V(x, x^, ..., x,^) has Xp ..., x,^ as its only zeros, we also have 

k 

(4.14) V(x, Xj^, ...» Xj^) = ^k^^l* •••» ^ 1 ^^ I (^ ~ x^). 

i=: 


If Xp ..., X|^ are real and negative, then (4.13) and (4.14) imply that 
C (Xj^, ..., X|^), 0 £ q £ k, are all of the same sign. This, along with (4.2), 
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implies (4.5). (4.6) is a direct consequence of (4.5) and (2.11). This 

proves (2). 

If Xj^, ...» X|^ are real and positive, then (4.13) and (4.14) imply that 
Cq(Xj, .... \)Cq+]^(X]^, .... Xj^) <0, 0_<q£k-l. This, along with (4.2), 
implies (4.7) and 


(4.15) ^ 

q=0 


,(n.k) _ 


^ j (~1)^ ^q^^l* •••> 

-3f0 


JvTT 




T 


[1 + o(l)] as n « 


V(— 1, X^ » •••» ) 

V(l» ^2* • • • » ) 


[1 + o(l)] as 




which, by (3.11), reduces to (4.8), This proves (3). 

Multiplying both sides of (4.2) by x^, summing over q from 0 to k, 
and finally making use of (4.13) and (3.11), (4.9) follows, thus proving (4). 


□ 
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